PERMUTATIONS ALL OF WHOSE PATTERNS OF A GIVEN LENGTH 

ARE DISTINCT 



PETER HEGARTY 

Abstract. For each integer k > 2, let F(k) denote the largest n for which there 
exists a permutation a € S„ all of whose patterns of length k are distinct. We prove 



3 



that F(k) = k + [\/2k — 3J + tf., where £ k <E {— 1, 0} for every k. Suggestions for 
£h ■ further investigations along these lines are discussed. 



0. Notation 

If /, g : N — > M+ are two functions, we write f(x) < g(x), or alternatively g(x) > 
U ! f(x), to denote that lim sup n ^ oc 44 < 1. 

^ ' As is usual in combinatorics, we will use interval notation for sets of integers. Hence, 

for real numbers a < b, the closed interval [a, b] consists of all integers n such that 
a < n < b, and so on. The interval [1, n\ will be denoted simply by [n]. Let S n denote 
the symmetric group on n letters. We will consider elements of S n as bijections a : 
[n] — > [n], and use the shorthand a = o\Oi ■ ■ ■ a n to denote that a(i) = The number 
n is called the length of the permutation. Some further more specialised notation will 
\q • be introduced below. 

o\ : 

1. Introduction 

vo - 

Let k, n be positive integers with k < n. If a 6 S n and tx G Sk, then one says that a 
contains n as a pattern if there exists a A>tuple (ax, a*), with 1 < ai < a 2 < • • ■ < 
a k < n and 



sign(cr aj . - o- ai ) = sign(7r i - 7r 4 ), for all 1 < z < j < k. (1.1) 

Later on, we will need a special notation to distinguish formally between restrictions of 
a and the patterns they represent, so we may as well introduce this notation now. Let 
(ai, a k ) and (a[, a' k ) be two /c-tuples as above, and let a 1 , o 2 be the corresponding 
restrictions of a, which are partial functions from [n] to [n]. Thus, for example, a 1 is the 
function from {a 1; a k } to [n] such that cr 1 (a j) = a ai . We write, somewhat informally, 
a 1 = a ai a a2 ■ ■ ■ a ak , and similarly for cr 2 . We shall use the notation a 1 C cr to indicate 
that cr 1 is a restriction of cr. If n E S k is the pattern represented by cr 1 , we will abuse 
notaton slightly and also write n C cr. 

The important distinction is the following: we write cr 1 = cr 2 if cij = ci^ for i = 
1, k, whereas we write cr 1 ~ cr 2 if they yield the same pattern, i.e.: if (1.1) holds for 
both the cij and the a[ and for the same n E S k . In the latter case one says that a 1 and 
cr 2 are pattern isomorphic as restrictions of cr. 
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The study of permutation patterns has developed rapidly over the last 20 years or so: 
see, for example, the recent book of Kitaev [Kj for a comprehensive overview of the 
literature. Much of the research undertaken is concerned with one or other of two com- 
plementary themes : 

Pattern avoidance : Here one is interested in enumerating, as a function of n, per- 
mutations in S n which contain no copies of a fixed set of one or more patterns, often of 
a fixed length k. 

Pattern packing : Here one is interested in constructing permutations which contain as 
many copies as possible of one or more fixed patterns, or alternatively, which contain 
as many different patterns as possible. 

There is by now a rather vast literature on the subject of pattern avoidance. Generating- 
function and other techniques allow for precise enumeration of permutations which 
avoid specific patterns, and also enable connections to be established to other kinds of 
permutation statistics. These results are quantitative in nature. Interesting qualitiative 
results are rarer, but there have been a number of notable achievements: for example, the 
exponential-growth result of Marcus and Tardos [MT], and the Kaiser- Klazar theorem 
HKKH establishing the dichotomy between exponential and polynomial growth. 

The literature on pattern packing is smaller but still substantial. For an introduction 
to the subject of packing copies of a specific pattern, see HAAHHS H. The paper of 
Miller [|M1 contains state-of-the-art results on the subject of permutations which contain 
as many different patterns as possible. It provides the best estimates to date for the 
following two natural functions : 

(i) the maximum number pat(n) of possible patterns, of unspecified length, in a per- 
mutation of length n. Miller proves that 

2 n - 0{n 2 2 n -^) < pat(n) < 2 n - e(n2 n ~^). (1.2) 

(ii) the minimum length L(k) of a so-called k-superpattern, i.e.: a permutation which 

contains every ir e 3*. as a pattern. Since ( J > k\, Stirling's formula gives a 

trivial lower bound of 2 

L(k) > (~) . (1.3) 

Nobody has yet succeeded in improving on this estimate. Miller obtained the best 
upper bound to date. She exhibited, for every k, a A;-superpattern whose length is at 
most k(k + l)/2. 

As Miller remarks in her paper, the problems of estimating the functions in (i) and 
(ii) above are, loosely speaking, "dual to one another". When reading this, it occurred 
to us to consider the following notion, which seems more directly "dual" to the notion 
of a superpattern : 

Definition 1.1. Let k, n be natural numbers with k < n. A permutation a £ S n is 
called a k-separator if it contains at most one copy of any n E Sk- 
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We found no explicit mention of this concept in the existing literature. The obvious 
object to study would seem to be the function F : N — > N, where F(k) denotes the 
maximum length of a /c-separator. The trivial lower bound for L(k) in (1.3) now trans- 

f F(k) \ 

lates into a trivial upper bound for F{k). For one must have I ) j < k\ and hence, 
by Stirling's formula, 

F(k)<(-) . (1.4) 



e t 

However, the property of being a A;-separator is far more restrictive than this. Indeed, it 
is almost trivial that 

F(k) < 2k. (1.5) 

To see this, let a E S n , a = a\ • • ■ a n , be a /c-separator. A priori, there are at most 
k possiblilites for the pattern formed by G\G2 ■ ■ ■ g^-iGt, as T runs from k up to n. 
Hence, if n > 2k, at least two of these patterns must coincide. 
The main result of our note is the following : 

Theorem 1.2. For each k > 2 one has 



F(k) = k+ [V2k - 3J + e fe , (1.6) 

where e k E { — 1, 0}. 

The proof of this result, which follows in Section 2, has much in common with the 
methods of AM]- To obtain a lower bound for F(k), we employ the same "tilted checker- 
board" permutations appearing in [Ml- For the upper bound, we further extend the 
idea employed in llMll. and attributed originally to Coleman [C], that to avoid repeat- 
ing patterns in a permutation a E S n , the so-called taxicab distance between elements 
i,j E [n] should be large. What seems to be new in our proof is a sort of optimisa- 
tion argument which allows for a very precise estimate for F(k) (there are hints of this 
argument in Section 6 of [MJ, but our approach seems to be different). 

Our paper closes with a short discussion section (Section 3), which includes some 
suggestions for extending the ideas introduced here. 

2. Proof of Theorem 1.2 
2.1. Proof of Lower Bound. In this subsection we prove that, for every k > 2, 



F(k) >k + [V2k — 3J — 1. (2.1) 

Note that 2 k — 3 is a perfect square if and only if k = 2m 2 — 2m + 2 for some m > 1. 
To prove (2.1) it thus suffices to prove, for every integer m > 1, the following two 
statements : 



(i) If 2m 2 — 2m + 2 < k < 2m 2 + 1, then there exists a E Sk+(2m-2) which is a 
fc-separator, 

(ii) If 2m 2 + 2 < k < 2m 2 + 2m + 1, then there exists a E Sk+(2m-i) which is a 
/c-separator. 
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For positive integers r, s, we employ the definitions of the r x s titled rectangle and 
the r x s titled checkerboard as given in [M jH Let a r > s denote the corresponding r x s 
checkerboard permutation, considered as a permutation of length [y] . Thus, r is the 
number of columns and s the number of rows in this permutation. If a 6 [r] and 6 G [s] 
are such that a = b (mod 2), then cr^ denotes the element of the r x s checkerboard 
which lies in its a:th column and 6:th row. Here, the columns are read from left to right 
and the rows from bottom to top. 

CASE 1: 2m 2 - 2m + 2 < k < 2m 2 + 1. 

We consider three subcases : 

(i.a) k = 2m 2 - 2m + 2. 

(i.b) k = 2m 2 — 2m + 2 + i, for some 1 < i < m. 
(i.c) k = 2m 2 — m + 2 + j, for some 1 < j < m. 

Subcase ( i.a): We need to exhibit a 6 S 2 m 2 which is a /c-separator. We take a = a 2m ' 2m . 
This is indeed a permutation of length |(2m) 2 = 2m 2 . By Proposition 4.4 of flM), a 
pattern n C a which is not uniquely represented must truncate or avoid at least two 
of the rows and/or columns of a. But, clearly, any such n must omit at least 2m — 1 
elements of the checkerboard. 

Subcase (i.b): We need to exhibit a E S 2m ?+i which is a /c-separator. We take a to 
be the prefix of a 2m+1 ' 2m whose complement consists of its last m + 1 — i elements, 
i.e.: we omit the elements cr^m+i'b" 1 for b > 2i + 1. We need to show that any 7r C a 
which omits 2m — 2 elements of a is represented uniquely. We already know this is true 
when i = from subcase (i.a) above. Now suppose i > and suppose ir 1 , n 2 are two 
restrictions of a such that n 1 ~ n 2 and each omits 2m — 2 elements of a. If n 1 ^ n 2 
then, reading both from left-to-right, there must be a first position where they differ. 
Say this is in position £ e [k\ and let irj = a a b , 7r| = a c d . Since both tx 1 and n 2 omit 
only 2m — 2 elements of the 2m x 2m checkerboard formed by all but the last column 
of a, at least one of a and c must equal 2m + 1. 

First suppose a = c = 2m + 1 and, WLOG, that b < d. Then 

o"2m+i b < o'ef < o"2m+i d, whenever e < 2m + 1, b < f < d and e = / (mod 2). 

(2.2) 

Since d > b + 2, there are at least 2m such elements a e j. Since n 1 and n 2 coincide 
to the left of position £, none of these 2m elements of a can lie in either t\K But this 
contradicts the assumption that the 7r J omit only 2m — 2 elements of a. 

Now suppose, WLOG, that 2m + 1 = c > a. Then all elements a e j must be missing 
from 7r 2 , where either (i) e = a, f > b (ii) a < e < 2m + 1 (iii) e = 2m + 1, f < d 
(iv) e < 2m + 1, d < f < b or (v) e < a, b < f < d (note that either (iv) or (v) is 
unsatisfiable). It is easy to see that the total number of pairs (e, /) of the same parity 
satisfying at least one of these conditions must then be at least 2m, again a contradiction. 



Miller already has some nice pictures as visual aids, so we do not reproduce these here. 
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Subcase (i.c): We need to exhibit a G 5 , 2m 2 +2m+: ,- which is a ^-separator. We take a to 
be the restriction of a 2m + l > 2rn + l whose complement consists of the leftmost m + 1 — j 
elements in its top row, i.e.: we omit the elements ^2^2m+i ' wnere 1 < £ < m+1— j. 
We need to show that any 7r C a which omits at 2m— 2 elements is represented uniquely. 
We already know this is true when j = from subcase (i.b) above. For j > 0, the ar- 
gument is essentially the same as in subcase (i.b), for we can rotate a by 90 degrees 
clockwise and apply a symmetry argument. We shall flesh out the details a little so as 
to leave no room for doubt. Suppose ir 1 , n 2 are two restrictions of a such that n 1 ~ n 2 
and each omits 2m — 2 elements of a. Firstly, if n 1 and n 2 coincide along the top row of 
a, then it is easy to see that the restrictions of both to the remaining rows must also be 
pattern isomorphic. Then we can apply subcase (i.b) directly. So we may suppose that, 
reading from left-to-right, there is a first position along the top row of a, say cr ,2m+i> 
such that, WLOG, a a ,2m+i G 7r 1 \7r 2 . Suppose a a ,2m+i is the t:th largest element in 
7T 1 . If the t:th largest element in it 2 appears in the top row of a, then it must be in 
the (a + 2):nd column or later. Since n 1 ~ n 2 , the same number of elements appear 
in both to the left of the t:th largest element. It follows unavoidably that ir 2 omits at 
least 2m elements of a, a contradiction. A similar argument can be applied if the t:th 
largest element of n 2 doesn't appear in the top row. We will unavoidably be led to the 
contradiction that n 2 omits at least 2m elements of a. 

CASE 2: 2m 2 + 2 < k < 2m 2 + 2m + 1. 

We consider three subcases : 

(ii.a) k = 2m 2 + 2. 

(ii.b) k = 2m 2 + 2 + i, for some 1 < i < m. 
(ii.c) k = 2m 2 + m + 2 + j, for some 1 < j < m. 

Subcase (ii.a): We take a = a 2m+1 > 2m+1 , which has length [ (2m + 1)2 ] = 2m 2 + 2m + 1. 
We need to show that any restriction which omits at most 2m — 1 elements of a uniquely 
represents its pattern. Let n 1 and n 2 be two such restrictions and suppose n 1 ~ n 2 , but 
7T 1 7^ 7r 2 . We can argue as in subcase (i.c) above that, unless n 1 and n 2 both contain 
the entire top row of a, at least one of them necessarily omits at least 2m elements of 
er, a contradiction. Hence, both contain the entire top row. By a symmetry argument, 
both must also contain the entire first column. Hence, neither can truncate a row or 
column of a. By Proposition 4.4 of [MJ, the only way left for 7T 1 and 7r 2 to represent 
the same pattern is for each to omit an even numbered row and column, and nothing 
else. But because each of n 1 and n 2 contains the entire top row, it is then easy to see 
that we cannot get the same pattern unless they omit exactly the same row and column, 
and hence are after all equal. 

We can deal with subcases (ii,b) and (ii.c) in exactly the same way as we did with 
(i.b) and (i.c) respectively, except that now we start with ff 2m + 1 ^ 2m + 1 and add in the 
elements of the last column and top row of a 2m + 2 < 2m + 2 one-by-one, first going from 
bottom to top in the last column and then from right to left in the top row. As in Case 1, 
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this shows that the value of F(k) — A; is non-decreasing for k in the interval covered by 
Case 2. 

2.2. Proof of Upper Bound. In this subsection we prove that, for every k > 2, 

F(k) <k + [V2fc-3J. (2.3) 

The basic idea is that a ^-separator must take numbers which are close together and 
permute them so they are placed far apart. We now make this precise. We need a couple 
of definitions: 

Definition 2.1. Let a e S n and let i,j e [n\. The distance between % and j in a, 
denoted d a (i,j), is defined as 

d ff (i,j) = \a- 1 (i)-a- 1 {j)\. (2-4) 

In other words, d a (i,j) is the number of spaces between i and j in the representation 
a = 0\02 • • ■ a n . For example, if n — 9 and er = 341679825, then d a {A, 2) = 6 and 
d ff (l,9)=3. 

Definition 2.2. Let a e S n and let i, j e [n]. We define the natural number tfj to 
be the number of integers / such that 

sign(Z-i) = sign(j-Z) and sign^" 1 ^) -a" 1 (i)) = sign^" 1 ^) -^ _1 (0)- ( 2 - 5 ) 

In other words, tfj is the number of integers lying strictly between i and j which also 
appear between i and j in the representation a = GiG 2 - • • <r n . For example, if n = 9 
and a = 341679825, then t\ 8 = 3, since each of the numbers 4, 6, 7 appear between 3 
and 8 in a, whereas t\ h = 1, since only 2 appears between 1 and 5 in a. Note that, a 
priori, for any i, j and a one has 

(k/;:, /••./• (2.6) 

We now require three lemmas : 

Lemma 2.3. Le? a e S n be a k-separator. Then for any % ^ j G [n] one 

j) >(n-k + 2)+tl j -\i-j\. (2.7) 

Proof. Without loss of generality, i < j and i appears to the left of j in the standard 
representation of a. We consider 

a — a 1 ■ ■ -a r -i i a r +i • • -Vs-i j cr s+ i ■ ■ ■ a n . (2.8) 

By definition, tfj of the numbers in the interval appear among oy+i, cr s _i. 
Hence, the remaining j — i — 1 — ^ ■ such numbers appear either to the left of i or to 
the right of j in (2.8). A total of n — d a (i, j) — 1 numbers appear either to the left of i 
or to the right of j. Hence, exactly 

(n - d„(i,j) - 1) - (j - i - 1 - t? d ) =n + tl 3 - (j -i)- d a (i,j) (2.9) 

of these numbers are not in the closed interval If (2.7) failed to hold, it would 

mean that the right-hand side of (2.9) was greater than or equal to k — 1. In other words, 
it would mean that at least k — 1 of the numbers appearing either to the left of i or to 
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the right of j in a were not in the interval If so, pick any k — 1 such numbers 

reading from left to right, say cr^, a i2 , a ip , a^, aj q , where p + q = k — 1 and 
ii < i 2 < ■ ■ ■ < i p < r < s < ji < ■ ■ ■ < j q . Then the two subsequences 

a k ■ ■ ■ a ip i a h ■ ■ ■ a jq and a h ■ ■ ■ a ip j o h ■ ■ ■ cr jq (2.10) 

yield two copies of the same length-/c pattern in a, contradicting the fact that a is a 
^-separator. This completes the proof of the lemma. 

Remark 2.4. Let a G S n and i, j £ [nj. The taxicab distance between i and j with 
respect to a, which we denote dj (i, j), is defined in Section 6 of [|Ml as 

dJ C (i,j) = li-jl + lcTi-ajl. (2.11) 

Also, in Millers' notation, 

^ = n-\S a (i,j)\. (2.12) 
Hence, in Miller's notation, (2.7) becomes 

dJ9 1 (iJ)>2n + 2-k-\S a (iJ)\. (2.13) 

Lemma 2.5. Let m £ N and to aia 2 ■ ■ ■ a m be any permutation of the integers in [m]. 
Then 

E(m — l)(m + 1) 
k - «ml < § -■ (2-14) 

i=i 1 

Proof. Let 

r := #{z G [m - 1] : a i+1 < a^}, s := m - 1 - r = #{z G [m - 1] : a m > a^}. 

(2.15) 

Let ii, z r be the indices such that a ij+ i < a ip for j = 1, .., r. Since the "take-off 
points" are distinct, for j = 1, .., r, and also the "landing points" a ij+i are distinct, 
for j = 1, r, it follows that 

r r 

22 Is - a ij+i\ < r ( m - 1) - 2 • )Xj - 1) = r(m - 1) - r(r - 1). (2.16) 

o=i i=i 

Similarly, if i' s are the indices such that <v + i > a^, for j = 1, s, then 

s 

|(ty. — aj'.+i| < s(m — 1) — s(s — 1). (2.17) 

j'=i 

From (2.16) and (2.17) it follows that 

TO— 1 

|a { - a i+1 \ < m(m - 1) - (r 2 + s 2 ). (2.18) 

i=l 

Since r + s = m — 1, the quantity r 2 + s 2 is minimised when m is odd and r = s = 
This proves the lemma. 
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Remark 2.6. A more careful analysis yields 



m— 1 



E(m — l)(m+ 1) „ 
k - ai+il < 1 j " " !> ( 2 - 19 ) 

i=l 

which is best-possible. We have no use for this slight improvement in what follows, 
however. 

Lemma 2.7. For any igN one has 

Lv^J+t^<2(Lv^J + 1), (2.20) 
with equality if and only if x = n 2 — I for some nGN. 
Proof. This is a simple exercise. 

We are now ready to prove (2.3). Let a e S n be a /c-separator. Since the numbers i£ ■ in 
(2.5) are, at the very least, non-negative, we have by Lemma 2.3, for any i ^ j e [n], 
that 

> (n-fc + 2)-|i-j|. (2.21) 

Let m G [n] . The value of m will be optimised in due course. Reading from left to right 
in the representation a — o\ ■ • • a n , the numbers from 1 to m will appear in some order, 
say as a ii: a im , where ii < • • • < i m . By (2.21), we have that 

m— 1 m— 1 

^d a (a iv a il+l ) > (m-l)(n-k + 2)-J2\ (T H- (T n + i\- ( 2 -22) 

i=i i=i 
The left-hand side of (2.22) cannot exceed n — 1, since we are reading along a from 
left to right. By Lemma 2.5, the sum on the right-hand side cannot exceed ( m ~ 1 K m+1 ) - 
Hence, 

n - 1 > (m - l)(n - * + 2) - ^if^ , (2.23) 
which we can rewrite as 

n< (fc-l) + I/ fc (m), (2.24) 

where 

2 A* — 3 

/ fc (m)=m + -. (2.25) 

m — 2 

Inequality (2.24) must hold for any choice of m E [n], so we choose m to make fk{rn) 
as small as possible. As a function of a real variable, fk(m) has a local minimum at 
m = \f2k — 3 + 2. Since, in our case, m must be an integer, we take 



f [V2A; - 3] + 2, if fc = 2m 2 + 1 for some u e N, 
m ~\ Lv / 2T^3j+2, otherwise. ( ' b) 

With the help of Lemma 2.7 one easily verifies that, for this choice of m one always has 

f k (m) < 2 (V2A;-3J + 2) (2.27) 
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which, together with (2.24), yields (2.3). 

3. DISCUSSION 

Since F(2) = 2 and F(3) = 4, one has e 2 = — 1 whereas e 3 = 0. We do not know any 
nice way to determine the value of e k in general. Indeed, we do not even know whether 
both —1 and occur infinitely often. It is natural to guess that any /c-separator of length 
F{k) must look very similar to those considered in subsection 2.1 , and hence basically 
look like a tilted checkerboard, minus some elements around the edges. However, we 
do not know any elegant approach to tackling these refinements of the extremal problem 
which has been considered in this article. 

There are other directions in which one might choose to extend the ideas presented 
here, so let us just make one of the more obvious suggestions. For each pair (k, I) of 
positive integers, let P(k, I) denote the maximum number of distinct patterns ir E S k 
which can appear in a permutation a e S k+ i and set 

Qik ' l) ^{¥)' <3 ' 1> 

Obviously, for any fixed k, Q(k, I) — > as / — >• oo. The basic question then is, how 
quickly does Q(k, I) go to zero ? Theorem 1.2 says that Q(k, I) = 1 at least for / < 

k + [V2k - 3J - 1. 
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